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Classification Method and Apparatus 

The invention relates to a method and apparatus for classifying items. 
The invention is concerned especially with the classification of coins or 
5 banknotes. 

Coins and banknotes inserted into mechanisms, such as vending 
machines, change machines and the like, are classified, on the one hand 
according to value, and/or on the other hand, between originals and copies or 
counterfeits thereof. Various methods of performing such classifications are 

10 known. As one example, described in GB 2 238 152 A, the contents of which 
are incorporated herein by reference. For example, measurements are taken 
from an inserted coin which represent different features of the coin, such as 
material and the thickness. Those measurements are then compared with 
respective stored pairs of values, each set of pair of values corresponding to a 

15 respective acceptable denomination of coin. When each measured value falls 
within the respective range for a given denomination, the inserted coin is 
classified as belonging to that denomination. 

In the type of classification discussed above, the measured values can 
be regarded as elements in a feature vector, and the acceptable measurements 

20 for different denominations correspond to regions in feature space, known as 
acceptance regions. In the example given above, the feature space is two- 
dimensional, and acceptance regions are rectangles, but the feature space can 
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have any number of dimensions, with corresponding complexity in the 
acceptance regions. For example, GB 2 254 949 A, the contents of which are 
incorporated herein by reference, describes ellipsoidal acceptance regions in 
three-dimensional feature space. 
5 Other examples of methods and apparatus for classifying bills and 

coins are described in EP 0 067 898 A, EP 0 472 192 A, EP 0 165 734 A. 
Other methods of classification include the use of neural networks, as 
described, for example, in EP 0 553 402 A and EP 0 671 040 A, the contents 
of which are also incorporated herein by reference. 

10 A significant problem in the classification of coins is the difficulty of 

separating different denominations. The population distributions of the 
different denominations of interest may be such that it is not possible easily to 
define appropriate acceptance boundaries with which adequately separate the 
denominations. Another problem is that in order to achieve adequate 

15 separation, it may be necessary to consider feature vectors having a large 
number of elements, which makes it more difficult to understand the various 
distributions and thus more difficult to obtain suitable acceptance boundaries. 
These problems are akin to general classification problems in data analysis 
which has been studied and have led to various different techniques including 

20 statistical methods. 

As an example of a statistical method of data analysis, principal 
component analysis ("PCA"), is a method whereby data expressed in one 
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space is transformed using a linear transformation into a new space, where 
most of the variation within the data can be explained using fewer dimensions 
than in the first space. The method of PCA involves finding the eigenvectors 
and eigenvalues of the covariance matrix of the variables. The eigenvectors 
5 are the axes in the new space, with the eigenvector having the highest 
eigenvalue being the first "principal component" and so on in decreasing size. 
Details of PCA can be found in textbooks on multivariate analysis, such as 
"Introduction to Multivariate Analysis" by Chatfield and Collins, see Chapter 
4. 

10 Another method of data analysis for classification purposes is linear 

discriminant analysis ("LDA"). LDA is useful when it is known that the data 
falls into separate groups. LDA aims to transform the data into a new space 
so as to maximize the distance between the centre of each group of data as 
projected onto axes in the new space and also to minimize the variance of 

15 each group along the axes. Methods for doing this are described in, for 
example, "Introduction to Statistical Pattern Recognition" by Fukunaga 
("Fukunaga"). In one example, the maximisation is performed by finding a 
linear transformation which maximises the value of the trace of C~'v where V 
is the inter-class covariance matrix and C is the covariance matrix of all 

20 samples . As explained in Fukunaga, this amounts to finding the eigenvectors 
and eigenvalues of C'v . The eigenvectors are the axes of the new space. As 
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described in the paper, when there are N classes, the new space has N-l 
dimensions. 

In many situations, neither PCA nor LDA will give adequate 
separation of the groups of data. A further method of data analysis is non- 
5 linear component analysis (NCA), which is based on PCA. In NCA, the data 
is projected into a new space using a non-linear mapping, and then PCA is 
performed in the new space. Details of NCA are given in the article 
"Nonlinear component Analysis as a Kernel Eigenvalue Problem" by 
Bernhard Scholkopf, Alexander Smola and Klaus-Robert Muller, Neural 

1 0 Computation 1 0, 1 299- 1319(1 998). ("Scholkopf.) 

A problem with NCA is that the dimension of the non-linear space 
may be very large, and so the number of principal components is also very 
large. For a given problem, it is not known how many principal components 
are needed for a good classification. 

1 5 Generally, the invention relates to a method of deriving a classification 

for classifying items of currency comprising measuring known samples for 
each class and deriving features vectors from the measured samples, mapping 
the feature vectors to a second space in which there is a clearer separation of 
the different classes and deriving a separating function using the separation in 

20 the second space. 

More specifically, the present invention provides a method of deriving 
a classifier for classifying items of currency into two or more classes 
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comprising measuring known samples for each class and deriving feature 
vectors from the measured samples, selecting a function corresponding to a 
mapping of the feature vector space to a second space, mapping feature 
vectors to image vectors, and deriving coefficients representing N-l axes, 
5 where N is the number of classes, in the second space, obtaining values 
representing the projections of the image vectors for the measured samples 
onto the N-l axes, and using those values to derive a separating function for 
separating the classes equivalent to a separating function in the second space. 
The invention also provides a method for classifying an item of 

10 currency comprising measuring features of the item, generating a feature 
vector from the measured values, and classifying the item using a classifying 
derived by a method according to any one of claims 1 to 6. 

The invention also provides an apparatus for classifying items of 
currency comprising measuring means for measuring features of an item of 

1 5 currency, feature vector generating means for generating a feature vector from 
the measured values, and classifying means for classifying the item using a 
classifier derived according to the method of any one of claims 1 to 6. 

The invention also provides an apparatus for classifying items of 
currency comprising measuring means for measuring features of an item of 

20 currency, feature vector generating means for generating a feature vector from 
the measured values, and classifying means for classifying the item using a 
function corresponding to a non-linear mapping of the feature vector space to 
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a second higher-dimensional space, mapping feature vectors to image vectors, 
and coefficients representative of N-l axes, where N is the number of classes 
that can be classified by the apparatus, in the second space, and a function 
equivalent to a separating function in the second space. 
5 An embodiment of the invention will be described with reference to 

the accompanying drawings of which: 

Fig. 1 is a block diagram of a classification system. 

Fig. 2 is a graph showing a distribution of coin data; and 

Fig. 3 is a graph showing a projection of the data of Fig. 2 onto new 

10 axes. 

The invention will be described with reference to a coin validator. 

In Fig. 1, box 1 designates a measuring system which includes an inlet 
2, a transport system in a form of a coin inlet and coin transport path (not 
shown) for presenting a sample 3 and a sensor system (not shown) for 
15 measuring physical quantities of the sample. The measuring system 1 is 
connected to a processing system 4 by means of a data bus 5. Processing 
system 4 is connected to a classifier 6 by means of a data bus 7. The output of 
the classifier 6 is connected to a utilization system 8 by means of a data output 
bus 9. The utilization system 8 is in this example a vending machine, but may 
20 also be, for example, a money exchange machine. 

The measuring system 1 measures features of an inserted coin 3. The 
measured features are assembled into a feature vector having n elements, 
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where each element corresponds to a measured feature by the processing 
system 4. In the present example, the sensor system measures values 
representative of the material, thickness and diameter of an inserted coin, 
using known techniques (see, for example, GB 2 254 949 A) and those values 
5 are the three elements of the corresponding feature vector. Briefly, each 
sensor comprises one or more coils in a self-oscillating circuit. In the case of 
the diameter and thickness sensors, a change in the inductance of each coil 
caused by the proximity of an inserted coin causes the frequency of the 
oscillator to alter, whereby a digital representation of the respective property 

10 of the coin can be derived. In the case of the conductivity sensor, a change in 
the Q of the coil caused by the proximity of an inserted coin causes the 
voltage across the coil to alter, whereby a digital output representative of 
conductivity of the coin may be derived. Although the structure, positioning 
and orientation of each coil, and the frequency of the voltage applied thereto, 

15 are so arranged that the coil provides an output predominantly dependent upon 
a particular one of the properties of conductivity, diameter and thickness, it 
will be appreciated that each measurement will be affected to some extent by 
other coin properties. 

Of course, many different features representative of items of currency 

20 can be measured and used as the elements of the feature vectors. For 
example, in the case of a banknote, the measured features can include, for 
example, the width of the note, the length of the note, and the intensity of 
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reflected or transmitted light for the whole or part of the note. As an example, 
a measuring system can be arranged to scan a banknote along N lines using 
optical sensors. Each scan line contains L individual areas, which are scanned 
in succession. In each area, there are measurements of M different features. 
5 More specifically, for each area, measurements are made of the reflectance 
intensities of red, green and infra-red radiation. The total number of 
measurements for a banknote is therefore L x M x N. These measurements 
form the components of a feature vector for the respective specimen, so that 
the feature vector has L x M x N components. Alternatively, the 

10 measurements can be processed in a different way to obtain a feature vector 
representative of the measured specimen. For example, local feature vectors 
for each measured area can be formed made up of the M measurements for 
that area, so that each local feature vector has M components. The local 
feature vectors can then be summed over the area of the banknote to obtain an 

1 5 M dimensional feature vector representative of the entire specimen. 

The feature vector is then input to the classifier 6. The classifier 6 
determines whether the sample belongs to any one of predetermined classes, 
using the feature vector and predetermined classification criteria including a 
separating function. If the sample is identified as belonging to an acceptable 

20 denomination of banknote, then it is accepted and the corresponding value of 
the note is credited. If the sample is identified as belonging to a known 
counterfeit group, it is rejected. 
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In this example, the system is for classifying two denominations of 
coins and one known counterfeit. A two-dimensional representation of the 
distribution in measurement space is shown in Fig. 2. The crosses represent 
samples of the first denomination, the dots represent counterfeits of the first 
5 denomination and the circles represent samples of the second denomination. 

The derivation of the separating function will be described below in 
general terms. The method of classification will then be described, also in 
general terms, followed by an explanation of the application of the general 
method to the specific example. 
10 Briefly, a method for deriving a separating function according to an 

embodiment of the invention maps the input space, that is the space of the 
measured feature vectors, using a non-linear map, into a higher dimensional 
space with linear properties. Separating hyperplanes are constructed in the 
mapped space using training data, using the equivalent of an LDA analysis in 
15 the mapped space. 

The population distribution of the denominations are analysed as 
discussed below. 

Initially, samples of each of the denominations of interest and each of 
the known counterfeit are measured and corresponding feature vectors are 
20 formed. The feature vectors from the samples, when plotted, for example, on 
a n-dimensional scatter graph, (where n is the number of measured features) 
fall roughly into groups, known as clusters. These measured samples are then 
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used to derive a separating function, as described below. In this example, 50 
samples for each denomination and 50 samples of the counterfeit, are used. 

Before proceeding further, a general explanation of the notation used 
is provided. 

5 The input space, that is, the space of feature vectors, is defined as X. 

N 

X = U X < ' where N is the nun iber of clusters. The cardinality of subspace X, 

N 

is denoted by n,, and the number of elements in X is M. Thus ]>]«, = M. x' 

is the transpose of vector x. 

In the input space, C is the covariance matrix, and 

c-jjiy, a) 

The method of the invention uses a kernel function k defining a dot 
product in a mapped space. Suppose <j> is a non-linear function mapping X 
into a Hilbert space F. 

<|>:X->F 

15 x-><|>(x) (2) 

and k(x,y) = <(. (x). <|> (y) = (x) <|> (y) 

As will be clear from the following discussion, it is not necessary 
explicitly to construct for a given k, although it can be shown, by Mercer's 
20 theorem, if for any k is a continuous kernel of a positive integral operator 
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which is positive, then a § exists (see SchSlkopf Section 3 and Appendix C). 
Nor is it necessary to perform dot products explicitly in F, which may be an 
infinite dimensional space. 

In F, V is the covariance matrix, and 

5 y=-~f i ^ l ^'(x J ) ( 3) 

We assume that the observations are centred in F, that is, that 
2 &( x j ) = 0 • A method of centering data will be described later. 
B is the covariance matrix of the cluster centres, and 
B = -^T n <W' (4) 
1 0 where <p, is the mean value of the cluster 1, that is 

^ (5) 
where x,j is the element j of the cluster I. 

B represents the inter-cluster inertia in F. 
V can also be expressed using the clusters as 

15 r-4;it«**WM (6) 

V represents total inertia in F. 
Letk u = k{x i ,x j ) 

^(kij^^^XpOKXqj)) 
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Let K be an (MxM) matrix defined on the cluster elements by J • 

where (K pq ) is the covariance matrix between cluster p and cluster q . 
*«(^),-....* where K„ = (M<-..% (8) 

K pq is a (n p x n q ) matrix 
and K is symmetric so that K' K = K n 
W is the matrix centre, and 

W = (W,) M .. iN (9) 

where W, is a (n, x n^ matrix with all terms equal to — . 

n, 

W is a M x M block diagonal matrix. 

The method essentially performs linear discriminant analysis in the 
mapped space F to maximise inter-cluster inertia and minimise the intra- 
cluster inertia. This is equivalent to eigenvalue resolution, as shown in 
Fukunaga. A suitable separating function can then be derived. 

More specifically, the method involves finding the eigenvalues X and 
eigenvectors v that satisfy 

A,Vv = Bv (10) 

The eigenvectors are linear combinations of elements of F and so there 
exist coefficients a„ q (p=l...N, q=l...np) such that 

v = £i> (ID 

P-I 9-1 

The eigenvectors of equation (10) are the same as the eigenvectors of 
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A0'(x u )Vv = 0 l (xi i )Bv (12) 
(see SchSlkopf). 

Using the definitions of K and W, and equations (6) and (1 1), the left- 
hand side of (12) can be expressed as follows: 

M p=l ,7-1 M *=l J 
and (xir)^«=^2]£^^(^)Z£#(»)[f («•) 

^ />=! »-l /-I *=l 1 J 

Using this formulate for all clusters i and for all elements j we obtain: 

W(x u ) f(x h ) 4'(x g ),...j , (x„X--J'(x f *„)Wv = 4; KKa 

M 

ff = (a»);-i...» 

where 

*-'••■», 
= (a^)„-i..jv 
where a P = ( a Jq = l... Bj> 

Using equations (4), (5) and (1 1), for the right term of (14): 

-ii|«4[i^.>I^#'^)^)] 
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and *'(*,)2*=^Zi> w i[^ <*(* w )j 

For all clusters i and for all elements j we obtain: 
(# , (*,,),..M#'(^)v.M#'(x # ),...,^(* wl ),...,^(x Jfc J)^=^JOra:a (14) 

Combining (13) and (14) we obtain: 
A,KKa = KWKa 
_. , a' KWKa 

"~7kk^ < 15 > 

K can be decomposed as K = QR (Wilkinson, 1971) so that Ka = 

QRa. 

R is upper triangular and Q is orthonormal, that is Q l Q = I. 

Q is a Mxr matrix and R is a rxM matrix, where r is the rank of K. It 
is known that the QR decomposition always exists for a general rectangular 
matrix. 

Then, let Ra = p (16) 
As the rows of R are linearly independent, for a given p, there exists at 
least one a solution. 

Hence Ka = QP and a'K = pV (K is symmetric). 
Substituting in (15) 
. a'KWKa 

Q is orthonormal so 1$ = Q'WQP (18) 



00/33262 PCT/IB99/02012 
15 

Equation (18) is in the form of a standard eigenvector equation. As K 
is singular, the QR decomposition permits work on a subspace QP, which 
simplifies the resolution. 

Then the coefficients a can be derived from p from equation (16), and 
then the eigenvectors from equation (1 1). 

These coefficients a are normalised by requiring that the 
corresponding vectors v in F be normalised. That is: 

vV=l (19) 

or (from equation 11) 

=a'Ka 

so(19)=>a'A:a=l (20) 

The steps given above set out how to find the eigenvectors v of 
equation (10). 

As its known from linear discriminant analysis (see, for example, 
Fukunaga), the number of eigenvectors = N-l where N is the number of 
clusters. The image of the clusters in the subspace spanned by the 
eigenvectors is found by projecting onto the eigenvectors. This is done using 
the following equation: 
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for an eigenvector v, and a feature vector x. 



(21) 




As can be seen from the above, the calculation does not require 
knowledge of <|>, or the need to calculate a dot product in F. 



function, the images of the clusters in the eigenvector subspace are well- 
separated and, more specifically, may be linearly separable, that is they can be 
separated by lines, planes or hyperplanes. 

Then a suitable separating function can easily be derived for 

10 classifying measured articles, using a known technique, such as inspection, 
averaging, Malalanobis distance, comparison with k nearest neighbours. 

As mentioned previously, it was assumed that the observations are 
centred in F. Centering will now be discussed in more detail. Firstly, for a 
given observation x a : element j of the cluster i, the image ftx^) is centered 

15 according to: 



5 



It has been shown in experiments that by use of a suitable kernel 




(22) 



We have then to define the covariance matrix K with centered points: 



(■ k u)pq = )) for a given cluster p and q. 
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J N N i N N 

Where we have introduced the following matrix: 
I pt =(hk )i=i r ^i^=i._^„ ( n P xn i) matrix whose elements are all equal to 1. 
In = Upi) p .\,..,n,i=i„n,(MxM) matrix whose elements are block 
matrices. 

10 Thus, for non-centred points <Kxy), we can derive Kfrom K and then 

solve for the eigenvectors of K . Then, for a feature vector x, the projection 
of the centred <(>-image of x onto the eigenvectors v is given by: 

p-\ q=\ 

The above discussion sets out in general terms the method of general 
15 discriminant analysis. The general principles will now be illustrated with 
reference to the specific example of the coin validator. 

Returning to the example of the coin validator at the beginning of the 
description, the feature vectors each have three elements, and there are three 
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clusters, corresponding to each of the two denominations of interest and the 
known counterfeit respectively. 

50 samples of each denomination and 50 samples of the counterfeit are 
input to the measuring system 1. As previously mentioned, the sensor 
systems measures samples to obtain values representative of the thickness, 
material and diameter in each case. Corresponding feature vectors are formed 
from the measured features for each sample. 

From the 50 samples feature vectors for each cluster, 37 are randomly 
selected for use in generating the separating function. 

A kernel function is then chosen. The kernel function is chosen on the 
basis of trial and error so as to choose whichever function gives the best 
separation results. There are a large number of kernel functions, satisfying 
Mercer's theorem, which may be suitable. Examples of kernel functions are 
the polynominal kernel: 

k(x,y) = (x.y) d ; 

the Gaussian kernel: 



k (x,y) = exp 



Q\x-y\\ 2 ) _ 



a' 



the hyberbolic tangent kernel: 



k (x,y) = tanh ((x.y)+6); and 



the sigmoid kernel: 



k(x,y)=(- 



1 



-)• 



+ e 



,-«'■?)+<!) 
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In this example, the Gaussian kernel is used, with a 2 = 0.01 . 

Using the selected samples and the kernel function, the matrices K and 
W are calculated. (Equations (8) and (9)). 

Then K is decomposed using QR decomposition. 

Then eigenvectors p and corresponding eigenvectors are calculated 
(equation (18)). 

Then coefficients a are calculated and normalised (equations (16) and 



Thereafter, the feature vectors of the remaining 13 samples for each 
cluster are projected onto the eigenvectors v (equation 21) and the results are 
plotted on a graph for easy inspection. In this example, there are 3 clusters, so 
there are 2 eigenvectors, and separation is in 2-d space. This is shown in Fig. 
3. As can be seen, the clusters are well-separated. More specifically, each 
cluster is projected on one point, which is the gravity centre. The separation 
of the projection of the clusters with the eigenvectors is then analysed, and 
used to derive a separation function. In this example, a linear separating 
function can easily be derived by inspection. For example, a suitable 
separating function is: 

for eigenvectors v b v 2 

and an input vector x 



(20)). 




PCT/IB99/02012 

20 



x belongs to group I (that is, it is of the first 

denomination); 

if[(f Wv / )>0and( ( *'(x)v 2 )<0] 

then x belongs to group 2 (that is, it is of the second 

denomination); and 

if (x)v, ) < 0 and {</>' (*)v 2 ) > o] 

then x belongs to group 3 (that is, it is a counterfeit of 
the first denomination). 

Classification for coins of an unknown denomination is then 
performed as follows. The inserted coin is sensed, and measurements 
representative of the material, thickness and diameter are obtained, as for the 
samples. A feature vector is then derived from the measured values. The 
feature vector is then projected onto the calculated eigenvectors (using 
equation 21) and the coin is classified in accordance with the projection 
values and the separating function, as described above. 

The analysis of the sample values for the initial data analysis and the 
derivation of the separating function can be done, for example, using a 
microprocessor. Similarly, the classifier 6 may be a microprocessor. 

As an alternative, the classifier 6 may be a neural network, such as a 
probabilistic neural network, or a perceptron. For example, the neural 
network may include N-l linear output neurones and M hidden neurones, 
where every kernel computation is a hidden neurone. Then the input weights 
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are the values x pq , and the coefficients a are the weights between the hidden 
neurones and the output layer. 

Also, the classifier may be a linear classifier, or a Support Vector 
Machine. 

5 The methods of the embodiment described above are equally 

applicable to a banknote or indeed to a classification of other sorts of items. 

Other methods of solving (10), for example by decomposing K using 

eigenvector decomposition, are possible. 

In the embodiment, a non-linear mapping to a higher-dimensional 
10 space is used. A linear mapping could be used instead. Also, mapping could 

be to a lower-dimensional space, or to a space of the same dimension as the 

feature vector space. 
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Claims 

1. A method of deriving a classification for classifying items of 
currency into two or more classes comprising measuring known samples for 

5 each class and deriving feature vectors from the measured samples, selecting a 
function corresponding to a mapping of the feature vector space to a second 
space, mapping feature vectors to image vectors, and deriving coefficients 
representing N-l axes, where N is the number of classes, in the second space, 
obtaining values representing the projections of the image vectors for the 
10 measured samples onto the N-l axes, and using those values to derive a 
separating function for separating the classes equivalent to a separating 
function in the second space. 

2. A method as claimed in claim 1 wherein the mapping is a non- 
15 linear mapping. 

3. A method as claimed in claim 1 or claim 2 wherein the second 
space is higher-dimensional than the first space. 



20 



4. A method as claimed in any one of claims 1 to 3 wherein the 
coefficients are derived by optimising the separation of the groups of image 
vectors for each class with respect to the axes. 
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5. A method as claimed in any one of claims 1 to 4 comprising 
deriving a matrix V where V is the covariance matrix in the second space and 
a matrix B where B is the covariance matrix of the class centres in the second 

5 space, deriving the solutions to the equation KV\=Bv, and deriving said 
coefficients from the solutions v. 

6. A method as claimed in any one of claims 1 to 5 wherein said 
function expresses a dot product in the second space in terms of a function on 

1 0 two elements of the feature vector space. 



7. A method as claimed in claim 6 wherein said function is k(x,y) 
where k(x,y) = (x.y) d . 
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8. 



A method as claimed in claim 6 wherein said function is k(x,y) 



where k(x,y) = exp- 
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9. A method as claimed in claim 6 wherein said function is k(x,y) 
where k(x,y) = tanh ((x.y) + 9). 
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1 0. A method as claimed in claim 6 wherein said function is k(x,y) 
where k(x,y)= (T —l^). 

11. A method for classifying an item of currency comprising 
measuring features of the item, generating a feature vector from the measured 
values, and classifying the item using a classifying derived by a method 
according to any one of claims 1 to 10. 



12. An apparatus for classifying items of currency comprising 
10 measuring means for measuring features of an item of currency, feature vector 
generating means for generating a feature vector from the measured values, 
and classifying means for classifying the item using a classifier derived 
according to the method of any one of claims 1 to 10. 

15 13. An apparatus for classifying items of currency comprising 

measuring means for measuring features of an item of currency, feature vector 
generating means for generating a feature vector from the measured values, 
and classifying means for classifying the item using a function corresponding 
to a mapping of the feature vector space to a second space, mapping feature 

20 vectors to image vectors, and coefficients representative of N- 1 axes, where N 
is the number of classes that can be classified by the apparatus, in the second 
space, and a function equivalent to a separating function in the second space. 
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14. An apparatus as claimed in claim 13 wherein the classifying 
means comprises means for deriving values representing the projection of the 
image of the feature vector of the measured item onto the or each axis. 

15. An apparatus as claimed as claimed in any one of claims 12 to 
14 wherein the classifying means comprises a neural network. 



16. An apparatus as claimed in any one of claims 12 to 15 
10 comprising a coin inlet and the measuring means comprises sensor means for 
sensing a coin. 



17. An apparatus as claimed in claim 1 6 wherein the sensor means 
is for sensing the material and/or the thickness and/or the diameter of a coin. 

18. An apparatus as claimed in any one of claims 12 to 14 
comprising a banknote inlet and wherein the measuring means comprises 
sensor means for sensing a banknote. 
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19. An apparatus as claimed in claim 18 wherein the sensor means 
is for sensing the intensity of light reflected from and/or transmitted through a 
banknote. 
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20. A coin validator comprising an apparatus as claimed in any one 
of claims 12 to 17. 

21. A banknote validator comprising an apparatus as claimed in 
any one of claims 12 to 14 or 18 or 19. 
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W.ai^/tli, (y,. ifrh/i/z,} (j< 
i) £ftl£LT, !y,, ^h/Pz.l fcJHW6e 2 
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U i!ft***>&T) lCfc-3T**«itt*ff, 
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[1] y.=+l*fcti-l*?y*7»icltf, 
;i/ z . ^7 y ^flicir^TSo 40 
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[3] ^7h;l/z,£[S£L/c3;£> FfcSSIclSWc 
ffiKStfSy.fcmW*. 
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oTttfitwastttf was**, tmriid u xa 

y , m&lKZt^Yb z , *«ft?*ft»SK. I IW 

nmsF^«'>Lrci8^ at y , *« ik^us t tic 
y.iiOJ&aacfcttTtaiv ^oafta. ofcis 

t-TSfc {^b/t'Z,, y,) (DF^r«4 

[0 0 4 8] ft*, K7Wl/z,, y.l <D 
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±ie«r— *-fc«y hT'lt, M(ta$2 i ftft 
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*lf, iiuftlW'The Nature of Statistical Learning Th 40 
eury" . WO X th rs ft „ fiftlO-Xft N i It , & 2 7 7 X 

* j 7->7i"frcJtLTiitK8WTf i ommitm 

/Mryhfttt, WJSOfflfcHilifc/uifftL'e. CO«iS 
{fc£iMlft 0 1 OfflOf ^^r-fTiSrSfci&T 1-3 
«1 0^7X7 3^77*7 (cntCO^TJt, 
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[0 0 5 3] (&4b\ mH, 7z£% 1 OffiON I S T 

icttLxm&n. 1 o%0«it»*i?5ofig0saHfc 

^ftjiflfto C. J. C. Burges, B. Schoelkopf, "Impro 
vingthe Accuracy and Speed of Support Vector Marhi 
nes", in press, NIPS '9(i, ) 
[00 54] 

1 0 0T', A^Jillliif-^^SVM (H7fvef) 
^n§„ 7srv7l 0 57, SVMtf£»Aaf-*K 
WLTlW^tl, Xf77l I Of, SVMtttftf-r- 
^^ h;K7)-t7 H^^BS-rSo XT77°1 3 5T', ^/Jn 
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mix) %.mmm. cos^vh^rW' 
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mmt%mimi\c7j<t, ztv/z i 5?, k 
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3„ *T7:/2 2 5T', SVM(4, flfcajfrS, 
[0 0 5 6] J'Biit, mttO«'Jvb-y b-^ Hi 

&Wmo)T-$j)\ miBiO'Mi'll/h-by b^* b 

077n-**ffl^T, tffcft^fc'hS^ttJvfe'y b 

ftgT'feSo x<mtLT, x«©IWvfcyMjr 

b;Mi, mflOJSf'Jvfe'y b'S* bMHry 

tiSc C«J8£, v><o ( x ) nm^vY^VYb* 

[0 0 5 7] /<*->3B»0lSS<0, *JW30**/j0!> 30 

■fa-tyVl 0 5 feet 1 1 0*^ftij, .IgJ&ifg 
1 1 014, r-*AJjfi*l 1 5, feit/SVMl 2 0 

'Ml 1 514, tt!B*Sfci&W3r-**SVMl 2 
O'NiSS. T-*A7jS£l 1 5 

c«18&, A?JT-*!4$jfi«t^-fe;UiUI (H* 
iff) T'&So SVM1 2 0ti, *aWOKatcfl£oT 

ft (m) SVM1 2 0(4, Aflr-*0#S*« 
■fttlSISS*. fttt©fflil0fc»fc:/n-fcytfi 0 5(c 
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7n-b7+M 0 5(4, fflAlf, **»J*1HW 
-7 a 7"n f 7 tf« J; b mm? a f 7 iW/D -b -y 
T'&5 0 Tn-fc-ytf 1 0 5(4. 2^(c, {HAtiflftiSfl, 

1 (ATM) ft£K*tt«£W8l l o«mwi»;*» 
«*«„ 

[0 0 5 8] H5 0->Xri»li2O«€-F, t4fe 
5, Jll^-b'fcitffift (MSD *-FTftft*5. 

[0 0 5 9]tt.t, *«Wfc-5^T»»SLft*«, "f»fi 
(cS4IEI2ft3J; d fc, *»W«SffiW«BirtT*S 

M*.tf, *ftUI«Jf 

[00 60] 

£ntf, Ifrhtv Wh/l/OljRfciO, ttlllifflff 

■9-sK— b^* b;l/7/l/rf'J X2»tfffli^ti8 <fc 5 4ft j» 
©IBM (0ljK.(f, |h1»«IS) tafflajSlT**, 

im i ] textttBo s vMommmwT'hz* 

m 2 ] ttftfrtf- h ^ h /Wc «fc 0 itttr- * * 2 o 
<0^7^fc»(IT*-IS««:iaT***, 
CEJ3] *»WOK«t<:«eoT S VMvXf A*f|«lt 

[H4] *^<DRatBeot S VMy7fA*i(|fl=-S 

etas] *wBoKa**«f«wii'X7 i i»o-aw3 

•yDy^^T-feS, 
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